CLAIMS 

What is claimed is: 

1. A speech-based processing system comprising: 

a database of PIM data associated with a user; 
a set of language models; 

a learning unit to provide language models of said set of language models 
based on the PIM data by employing a language model learning algorithm; 

a recognition server to recognize an utterance of the user by using one of 
the language models; and 

a speech application to identify and access a subset of the PIM data 
specified by the utterance by using a result of recognizing the utterance. 

2. A speech-based processing system as recited in claim 1, wherein the utterance 
comprises a short reference to said subset of the PIM data, said short reference 
consisting of less than all of said subset of the PIM data. 

3. A speech-based processing system as recited in claim 1, wherein the learning 
unit is configured to employ the language model learning algorithm periodically 
to provide language models based on the PIM data. 

4. A speech-based processing system as recited in claim 1, wherein the learning 
unit is configured to employ the language model learning algorithm on-the-fly to 
provide a language model based on the PIM data during a session with the user. 
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5. A speech-based processing system as recited in claim 1, wherein the language 
model learning algorithm uses grammar induction. 

6. A speech-based processing system as recited in claim 1, wherein the language 
model learning algorithm trains statistical language models. 

7. A speech-based processing system as recited in claim 1, wherein the speech 
application and the learning unit comply with a common set of APIs designed 
for accessing PIM data. 

8. A speech-based processing system as recited in claim 1, wherein the PIM data 
is provided in one or more XML documents. 

9. A speech-based processing system as recited in claim 1, wherein the learning 
unit comprises a plurality of modules, each module containing a set of heuristics 
tailored for acquiring language models for one of a plurality of types of PIM 
data. 

10. A speech-based processing system as recited in claim 9, wherein the plurality 
of types of PIM data comprises personal address book information. 

11. A speech-based processing system as recited in claim 9, wherein the plurality 
of types of PIM data comprises personal calendar information. 
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1 12. A speech-based processing system as recited in claim 9, wherein the plurality 

2 of types of PIM data comprises information from email messages of the user. 

1 13. A speech-based processing system comprising: 

2 a database of PIM data associated with a user; 

3 a set of language models; 

4 means for employing a language model learning process to provide 

5 language models of said set of language models based on the PIM data; 

6 means for recognizing an utterance of the user based on one of the 

7 language models, the utterance comprising a short reference to a subset of the 

8 PIM data; and 

9 means for identifying and accessing the subset of the PIM data specified 
10 by the utterance based on a result of recognizing the utterance. 



1 14. A speech-based processing system as recited in claim 13, wherein the means 

\ A 2 for employing a language model learning process comprises a plurality of 

3 modules, each module containing a set of heuristics tailored for acquiring 

4 language models for one of a plurality of types of PIM data. 

1 15. A speech-based processing system as recited in claim 13, wherein the means 

2 for employing a language model learning process comprises means for 

3 periodically providing language models based on the PIM data. 
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16. A speech-based processing system as recited in claim 13, wherein the means 
for employing a language model learning process comprises means for providing 
a language model based on the PIM data on-the-fly during a session with the 
user. 

17. A speech-based processing system as recited in claim 13, wherein the 
language model learning process uses grammar induction. 

18. A speech-based processing system as recited in claim 13, wherein the 
language model learning process trains statistical language models. 

19. A speech-based processing system comprising: 

a database of PIM data associated with a user; 

a set of language models; 

a language model server including 

a learning unit to provide the set of language models based on the 
PIM data by employing a language model learning algorithm, and 

a look-up unit to look-up and select one of the language models 
based on a specified identity of the user; 

a recognition server to recognize an utterance of the user according to the. 
selected language model; and 
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11 a speech application to trigger operation of the look-up unit and to 

. 12 identify and access a subset of the PIM data specified by the utterance using a 

13 result of recognizing the utterance. 

1 20. A speech-based processing system as recited in claim 19, wherein the 

2 utterance comprises a short reference to said subset of the PIM data, said short 

3 reference consisting of less than all of said subset of the PIM data. 

1 21. A speech-based processing system as recited in claim 19, wherein the 

O 2 learning unit comprises a plurality of modules, each module containing a set of 

}0 3 heuristics tailored for one of a plurality of types of PIM data. 

■!■«* 1 22. A speech-based processing system as recited in claim 19, wherein the 

!- 2 learning unit is configured to employ the language model learning algorithm 

H - £ 

i?j 3 periodically to provide language models based on the PIM data. 

'i ^ 

^ 1 23. A speech-based processing system as recited in claim 19, wherein the 

2 learning unit is configured to employ the language model learning algorithm on- 

3 the-fly to provide a language model based on the PIM data during a session with 

4 the user. 

1 24. A speech-based processing system as recited in claim 19, wherein the 

2 language model learning algorithm uses grammar induction. 



19 



1 25. A speech-based processing system as recited in claim 19, wherein the 

2 language model learning algorithm trains statistical language models. 

1 26. A speech-based processing system as recited in claim 19, wherein the speech 

2 application and the learning unit comply with a common set of APIs for 

3 accessing PIM data. 

1 27. A speech-based processing system as recited in claim 1, wherein the PIM 

2 data is provided in one or more XML documents. 

O 

!S 1 28. A speech-based processing system comprising: 
j y 2 processor means for executing software; and 

1^ 3 storage means accessible to the processor means for storing software, the 

]«* 

^ 4 storage means having stored therein a learning unit to learn a set of language 
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5 models based on a set of PIM data and a speech application to access a subset of 

6 the PIM data specified by a short reference to said subset uttered by a user, by 

7 using a result of recognizing the utterance. 

1 29. A speech-based processing system as recited in claim 28, wherein the 

2 learning unit comprises a plurality of modules, each module containing a set of 

3 heuristics tailored for acquiring language models for one of a plurality of types 

4 of PIM data. 

1 30. A method of facilitating speech recognition comprising: 
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2 using an automated language model learning process to acquire a set of 

3 language models based on PIM data associated with a user; 

4 recognizing an utterance by the user by using one of the language models; 

5 and 

6 using the recognized utterance of the user to identify and access a subset 

7 of the PIM data. 

1 31. A speech-based processing system as recited in claim 30, wherein the 

2 utterance comprises a short reference to said subset of the PIM data, said short 
Jo 3 reference consisting of less than all of said subset of the PIM data. 

i'-Q 

!«& 
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I 1 32. A method as recited in claim 30, wherein said using an automated language 

{** 2 model learning process comprises using a plurality of modules of the automated 

O 3 language model learning process to acquire the set of language models based on 

\Z 4 PIM data associated with the user, wherein each module contains a set of 

|i 5 heuristics tailored for a particular type of PIM data. 

1 33. A method as recited in claim 30, wherein said using an automated language 

2 model learning process comprises periodically providing language models based 

3 on the PIM data. 

1 34. A method as recited in claim 30, wherein said using an automated language 

2 model learning process comprises providing a language model based on the PIM — - 

3 data on-the-fly during a session with the user. 
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35. A method as recited in claim 30, wherein the language model learning 
process uses grammar induction. 

36. A method as recited in claim 30, wherein the language model learning 
process trains statistical language models. 

37. A method of facilitating speech recognition comprising: 

using an automated language model learning process to acquire a set of 
language models based on PIM data associated with a user; 

recognizing an utterance by the user by using one of the language models; 

and 

using a speech application to identify and access a subset of the PIM data 
based on the recognized utterance. 

38. A method as recited in claim 37, wherein the utterance comprises a short 
reference to said subset of the PIM data, said short reference consisting of less 
than all of said subset of the PIM data. 

39. A method as recited in claim 38, wherein said using an automated language 
model learning process comprises using a plurality of modules of the automated 
language model learning process to acquire the set of language models based on 
PIM data associated with the user, wherein each module contains a set of 
heuristics tailored for one of a plurality of types of PIM data. 
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1 40. A method as recited in claim 38, wherein said using an automated language 

2 model learning process comprises periodically providing language models based 

3 on the PIM data. 

1 41. A method as recited in claim 38, wherein said using an automated language 

2 model learning process comprises providing a language model based on the PIM 

3 data on-the-fly during a session with the user. 

1 42. A method as recited in claim 38, wherein the language model learning 

(3 2 process uses grammar induction. 
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1 43. A method as recited in claim 38, wherein the language model learning 

2 process trains statistical language models. 

1 44. A speech-based processing system as recited in claim 38, wherein the 

2 automated language model learning process and the speech application are each 

3 compliant with a set of APIs for accessing PIM data. 

1 45. A speech-based processing system as recited in claim 38, wherein the PIM 

2 data is provided in one or more XML documents. 
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